A NOTE ON THE GROUP-THEORETIC APPROACH TO 
FAST MATRIX MULTIPLICATION 



IVO HEDTKE 

Abstract. In 2003 Cohn and Umans introduced a group-theoretic approach to fast 
matrix multiplication. This involves finding large subsets S, T and J7 of a group G 
satisfying the Triple Product Property (TPP) as a means to bound the exponent u of 
the matrix multiplication. We show that S, T and U may be be assumed to contain 
the identity and be otherwise disjoint. We also give a much shorter proof of the upper 
bound IS*! + \T\ + \U\ < \G\+2. 

1. Introduction 

The naive algorithm for matrix multiphcation is an 0{n^) algorithm. From Volker 
Strassen ([5]) we know that there is an (9(?7,^'^^) algorithm for this problem. WlNO- 
GRAD optimized Strassen's algorithm. While the Strassen- Winograd algorithm 
is the variant that is always implemented (for example in the famous GEMMW pack- 
age), there are faster ones (in theory) that are impractical to implement. The fastest 
known algorithm runs in C(r2,^-^^^) time (see [3] from DON COPPERSMITH and Shmuel 
Winograd). Most researchers believe that an optimal algorithm with 0{n'^) runtime 
exists, but since 1987 no further progress was made in finding one. 

Because modern architectures have complex memory hierarchies and increasing par- 
allelism, performance has become a complex tradeoff, not just a simple matter of 
counting flops. Algorithms which make use of this technology were described in [1] 
by D 'Alberto and Nicolau. An also well known method is Tiling: The normal 
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algorithm can be speeded up by a factor of two by using a six loop implementation that 
blocks submatrices so that the data passes through the LI Cache only once. 

In 2003 CoHN and Umans introduced in [2j a group-theoretic approach to fast matrix 
multiplication. The main idea is to embed the matrix multiplication over a ring R into 
the group ring RG, where G is a (finite) group. A group G admits such an embedding, 
if there are subsets S, T and U which fulfill the so called Triple Product Property. 

Definition (Right Quotient). Let G be a group and 7^ X C G be a nonempty subset 
of G. The right quotient Q{X) of X is defined by Q{X) := {xy^^ : x,y E X}. 

Definition (TPP). We say that the nonempty subsets S, T, and {/ of a group G fulfill 
the Triple Product Property (TPP) if for s G Q{S), t G Q{T) and u G Q{U), stu = 1 
holds iSs = t = u = l. 

CoHN and Umans found a way to bound the exponent u of the matrix multiplication 
with their framework. Therefore, for a fixed group G we search for TPP triples S, T and 
U which maximize IS"! ■ |r| ■ \ U\, for example with a brute force computer search. Here 
one can use Murthy's upper bound (s. Corollary [6]) and our intersection condition (s. 
Theorem [1]) . 

2. Results 

We show that S, T and U may be be assumed to contain the identity and be otherwise 
disjoint. 

Theorem 1. If S' , T' and U' fulfill the TPP, then there exists a triple S, T and U 
with 

\S\ = \S'\, |r| = |T'|, \U\ = \U'\ and s nT = T nu = s nu = 1 
which also fulfills the TPP. 
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For the proof of our main result we need some auxiliary results. 

Lemma 2. Let (I) X C G be a nonempty subset of a group G and g E G. Then 

(1) 1 G QiX), 

(2) g G Q{X) ^ g-^ G Q{X) and 

(3) \X\<\Q{X)\. 

Proof. (1) Because X 7^ there exists an x G X and so 1 = xx^-^ G Q{X) follows. 

(2) If g E Q{X) then there are x,y E X with g = xy~^. This implies, that 
g-^ = (xy-^)-^=yx-^eQ{X). 

(3) For a fixed x E X the map X — )■ Q(X), y 1— )■ yx~^ is injective and therefore 
\X\ < \Q{X)\ holds. □ 

Lemma 3. // S , T and U fulfill the TPP then 

Q{X)nQ{Y) = l 

holds for allX^Y e {S, T, U}. 

Proof. We know 1 G Q(X)nQ(y) from Lemma[2](l). Now assume that \Q{X)nQ{Y)\ > 
2. In this case there is an 1 7^ s G Q{X) fl Q{Y). From Lemma [2t^ 2) we know, that 
x~^ G Q{X)r\Q{Y), too. Moreover 1 is an element of every right quotient and therefore 
the factors x, x~^ and 1 occur in {stu : s G Q{S),t G Q(T),u G Q{U)} and the TPP is 
not fulfilled. So we have \Q{X) fl Q(Y)\ = 1 which completes the proof. □ 

Theorem m and Corollary E] below are originally due to Murthy (2009). Our proofs 
are somewhat shorter. 

Theorem 4 (Murthy's minimal disjointness property). If S, T and U fulfill the TPP 
then 

\xr\Y\<i 

holds for allX^Y e {S, T, U}. 
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Proof. Assume that |X fl y| > 2. Then there are x ^ y E X CiY. Therefore we have 
1 7^ xy^^ G Q{X) n Q{Y). This is a contradiction to Lemma [3l □ 

Now we can prove our main result. 

Proof of Theorem{^ We fix sq E S', to E T' and Uq E U' . Now we define S := {sSq^ : 
s E S'} and T and U in the same way. Obviously \S\ = \S'\, \T\ = \T'\ and \U\ = \U'\ 
holds. Because of 

Q{S) = {ss-^ ■.s,seS} = {ssq\ssoY' ■.s,seS'} = {sr^ ■.s,seS'} = Q{S'), 

Q{T) = Q{T') and Q{U) = Q{U') the triple S, T and U fulfill the TPP, too. It is also 
clear, that 1 E S , 1 E T and 1 eU . The result now follows from Theorem HI □ 

Finally we can prove the upper bound of \G\ +2 for the additive size of a TPP triple. 
Theorem 5. If S , T and U fulfill the TPP then \Q{S)\ + \Q{T)\ + \Q{U)\ < \G\ + 2. 

Proof. Note that Q{S) U Q{T) U Q(U) C G and 
\Q{S)UQ{T)UQ{U)\ 

= \Q{S)\ + \Q{T)\ + \Q{U)\ - \Q{S) n Q{T)\ - \Q{T) n Q{U)\ - \Q{S) n Q{U)\ 
+ \Q{S)nQ{T)nQ{U)\. 

Because of Lemma [3] all intersections have size 1 and the statement follows. □ 
Corollary 6 (Murthy). IfS, T and U fulfill the TPP then \S\ + \T\ + \U\ < \G\ + 2. 

Proof. The statement follows from Lemma [2](3) and Theorem [5l □ 

Note that Theorem [5] is more effective than Corollary [6] when searching for TPP 
triples. 
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